Fast Text Access Methods for Optical and Large Magnetic Disks: Design and Performance Comparison
نویسندگان
چکیده
High capacity disks, especially optical ones, are commercially available. These disks are ideal for archiving large text data bases. In this work, we examine efficient searching techniques for such applications. We propose a unifying framework, which reveals the similarities between signature files and an inverted file using a hash table. Then, we design methods that combine the ease of insertion of the signature files with the fast retrieval of the inverted files. We develop analytical models for their performance and we verify it through experimentation on a 2.8 Mb data base. The agreement between theory and experimentation is very good. The results show that the proposed methods achieve fast retrieval, they require a modest lo%-30% space overhead, (as opposed to 50%-300% overhead [13] for the inverted files), and they do not require rewriting; thus, they can handle insertions easily, they permit searches during an insertion and they can be used with write-once optical disks. Using our verified model, the performance predictions for the proposed methods on large data bases (e.g., 250 Mb) are very promising.
منابع مشابه
Fast Text Access Methods for Optical and Large Magnetic Disks: Designs and Performance Comparison
High capacity disks, especially optical ones, are commercially available. These disks are ideal for archiving large text data bases. In this work, we examine efficient searching techniques for such applications. We propose a unifying framework, which reveals the similarities between signature files and an inverted file using a hash table. Then, we design methods that combine the ease of inserti...
متن کاملArchiving Techniques for Temporal Databases
This paper describes archiving strategies for append-only temporal databases. We present a storage architecture where optical disks work in tandem with magnetic disks. Magnetic disks are used for storing current versions and recent past versions, whereas optical disks are dedicated for archiving older past versions. Similarly, temporal access structures are stored on both magnetic and optical d...
متن کاملThe Future of Mass Storage Systems - Guest Editor's Introduction
the Sixth IEEE Symposium on Mass Storage Systems held at Vail, Colorado, in June 1984. To owners of large collections of data at the symposium, the future looked bright indeed. There were descriptions of new highperformance optical digital data disk products,'-3 descriptions of new very high density magnetic tape4 and magnetic disk formats, and a discussion of the design decisions leading up to...
متن کاملParallel file striping on optical jukebox servers
In the near future, large digital media servers are expected to offer storage capacities in the order of petabytes. Servers made of clusters of PC's connected to jukeboxes may represent an interesting alternative compared with servers made of arrays of magnetic disks. However, due to disk exchange overhead, higher seek times and lower data transfer rates, access to data located on optical disks...
متن کاملChallenges for Tertiary Storage in Multimedia Servers
The low cost per megabyte of optical disk and magnetic tape storage make these technologies particularly attractive for use in large capacity storage servers, including multimedia servers. However, these devices have performance problems that range from high costs for many optical drives to low performance and lack of random access in tape drives. We evaluate the performance on multimedia appli...
متن کامل